PSAR-Align: improving multiple sequence alignment using probabilistic sampling
نویسندگان
چکیده
SUMMARY We developed PSAR-Align, a multiple sequence realignment tool that can refine a given multiple sequence alignment based on suboptimal alignments generated by probabilistic sampling. Our evaluation demonstrated that PSAR-Align is able to improve the results from various multiple sequence alignment tools. AVAILABILITY AND IMPLEMENTATION The PSAR-Align source code (implemented mainly in C++) is freely available for download at http://bioen-compbio.bioen.illinois.edu/PSAR-Align.
منابع مشابه
PSAR: measuring multiple sequence alignment reliability by probabilistic sampling
Multiple sequence alignment, which is of fundamental importance for comparative genomics, is a difficult problem and error-prone. Therefore, it is essential to measure the reliability of the alignments and incorporate it into downstream analyses. We propose a new probabilistic sampling-based alignment reliability (PSAR) score. Instead of relying on heuristic assumptions, such as the correlation...
متن کاملA Method of Multiple Protein Sequence Alignment Using a Hybrid Approach
Multiple protein sequence alignment is an extension of pairwise alignment to incorporate more than two sequences at a time. Multiple protein sequence alignment methods try to align all of the sequences in a given query set. Multiple protein sequence alignments are often used in identifying conserved sequence regions across a group of sequences hypothesized to be evolutionarily related. Many app...
متن کاملMultiple Sequence Alignment for Morphology Induction
MetaMorph is a novel application of multiple sequence alignment (MSA) to natural language morphology induction. Given a text corpus in any language, we sequentially align a subset of the words of the corpus to form an MSA using a probabilistic scoring scheme. We then segment the MSA to produce output analyses. We used this algorithm to compete in the 2009 Morpho Challenge. The F-measure of the ...
متن کاملParallel FSA: Improving the Performance of Multiple Sequence Alignment using a Workstation Cluster and Database
Multiple Sequence Alignments (MSA) are widely-used tools for biological sequence analysis such as function prediction and phylogeny inference. Recently, a MSA algorithm based on a statistically-sound method for model selection and parameterization, Fast Statistical Alignment (FSA), has been introduced. Although FSA is state-of-the-art with respect to accuracy and ability to scale to thousands o...
متن کاملEvolutionary inaccuracy of pairwise structural alignments
MOTIVATION Structural alignment methods are widely used to generate gold standard alignments for improving multiple sequence alignments and transferring functional annotations, as well as for assigning structural distances between proteins. However, the correctness of the alignments generated by these methods is difficult to assess objectively since little is known about the exact evolutionary ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Bioinformatics
دوره 30 7 شماره
صفحات -
تاریخ انتشار 2014